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Abstract:  We  develop  a general  approximation  ^ "'’work  for  use  in 
optimal  control  problems  governed  by  nonlinear  anctional  differ- 
ential equations.  Our  approach  entails  only  the  use  of  linear 
semigroup  approximation  results  while  the  nonlinearities  are 
treated  as  perturbations  of  a linear  system.  Numerical  results  are 
presented  for  several  simple  nonlinear  optimal  control  problem 
examples. 

Keywords:  Functional  differential  equations,  nonlinear  control 
problems,  semigroups,  approximation  techniques. 
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1 . Introduction 

The  purpose  of  this  paper  is  to  extend  and  provide  proofs 
for  the  theoretical  results  announced  earlier  in  Refs.  1 and  2 and 
to  demonstrate  that  the  approximation  method  thus  proposed  is  a 
reasonable  one  for  certain  classes  of  nonlinear  functional 
differential  equation  (FDE)  control  problems  by  reporting  some 
of  our  numerical  experience.  The  idea  (which  is  fundamental  to 
our  efforts  here)  of  approximating  infinite  dimensional  FDE 
systems  by  finite  dimensional  ordinary  differential  equation  (ODE) 
systems  is  not  a new  one  and  the  reader  should  consult  Ref.  3 for  a 
fairly  complete  summary  of  the  previous  literature  on  this  idea. 

In  the  presentation  to  follow  we  shall  develop  a theoretical 
framework  for  nonlinear  control  problems  and  then  offer  sample 
numerical  results  for  several  simple  examples.  In  developing 


the  theoretical  foundations  in  sections  2 and  3,  we  shall  demonstrate 
that,  for  a reasonably  generous  class  of  interesting  nonlinear 


systems,  approximation  ideas  involving  only  linear  semigroup 
theory  (developed  previously  in  Refs.  3 and  4 ) are  adequate  for 
development  of  an  abstract  nonlinear  approximation  framework. 
These  ideas  are  not  at  all  technically  complicated  but  rather 
rely  only  on  quite  standard  techniques  from  known  ODE  and  FDE 
theory  plus  frequent  use  of  the  Gronwall  inequality. 


Among  other  authors  (Refs.  5-9)  who  have  developed 
directly  related  theoretical  results  for  the  approximation  of 
nonlinear  FDE  systems,  only  Repin  does  not  make  use  of  the 
machinery  of  nonlinear  semigroup  theory.  But  the  assumptions  in 
the  Repin  presentation  lead  to  results  that  are  not  directly 
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applicable  to  control  problems.  Thus  while  there  is  some  overlap 
between  our  results  and  those  found  in  each  of  the  references 
cited  above,  our  treatment  differs  from  each  of  those  in  some 
essential  aspects  and  it  is  accurate  to  say  that  our  results 
neither  completely  subsume  nor  are  they  subsumed  by  the  results 
(individually  or  collectively)  of  these  authors. 

A feature  of  our  presentation  is  the  simplicity  of  the 
ideas  and  proofs  involved,  given  the  linear  theory  of  Ref.  3. 

Indeed,  one  should  note  the  similarity  between  the  classical 
techniques  for  existence,  uniqueness  and  continuous  dependence 
results  used  in  section  2 (one  could,  of  course,  instead  use 
there  standard  - and  perhaps  more  elegant  - fixed  point  type 
arguments)  and  those  employed  in  the  approximation  arguments  of 
section  3.  A disadvantage  of  our  formulation  in  this  paper  is 
that  the  present  assumptions  allow  one  to  include  discrete  delay 
terms  - x(t-r)  terms  as  opposed  to  x^  terms  - only  in  the  linear 
part  of  the  control  system  equations.  We  are  currently  investigating 
to  what  extent  our  ideas  can  be  further  extended  to  treat  general 
nonlinear  differential-difference  equation  control  systems. 

The  notation  employed  in  the  sequel  is  quite  standard. 

Lp([a,bl,X)  will  denote  the  normed  space  of  X-valued  Lp  "functions" 

(we  shall  not  distinguish  between  representatives  and  equivalence 
classes  since  the  meaning  will  always  be  apparent  to  the  reader) 
defined  on  la,bl  and  Lp(a,b)  (or  Lp(a,b)  if  v*l)  will  sometimes 
be  used  when  X > where  R^  is  Euclidean  space.  The  class  of 
functions  f with  |f|^  locally  integrable  will  be  denoted  by 


> 


L and  those  with  p continuous  derivatives  will  be  denoted  by 
P 

Throughout,  the  space  Z = R”xL2(-r,0)  will  be  used  with 

2 2 2 

the  usual  product  topology,  i.e.  if  z = (ri,<(i)GZ,  |z|  = |n|  + 1 (|)  1 

Here,  as  elsewhere,  the  general  symbol  | • | is  used  for  the  norm 
when  no  confusion  will  result.  That  is,  unless  otherwise  stated 


explicitly,  |x|  denotes  the  norm  of  xGX  where  the  norm  is  the 


one  understood  for  the  space  X.  The  space  of  continuous  functions 

on  [a,bl  into  Z with  the  usual  supremum  norm  will  be  denoted  by 

i^([a,b],Z)  and  AC*'[a,b]  will  be  the  space  of  p'^-valued  absolutely 

continuous  functions  on  Ia,b]  (with  the  supremum  norm,  i.e.  ^ 

topology,  unless  otherwise  specified) . The  vector  space  of  n^m 

matrices  will  be  written  as  . Finally  for  a given 

n , m 

measurable  function  s -+•  x(s),  the  symbol  x^  denotes  the 
measurable  function  on  [-r,0]  given  by  0 -+  x^(0)  = x(t+0), 

-r  £ 0 £ 0.  For  vectors  we  shall  not  in  general  distinguish 
between  the  column  form  and  its  transpose  when  the  usage  makes 
clear  our  intended  meaning. 


I 
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2.  Abstract  Formulation  of  the  FDE  Control  System 


We  shall  consider  in  the  following  discussions  the  basic 


nonlinear  control  system 


x(t)  = L(x^)  + f (t,x(t)  ,x^,u(t) ) , tG  [0,tj^]  , 
x(0)  = = <<>  » 


where  tj^  is  finite,  (n»<J>)€Z,  and  the  linear  part  L is  as  given  in 

Ref.  3.  That  is,  there  are  matrices  and 

i n,n 

D G L- ( [-r , 0] , ^ ),  and  numbers  h . , 0 = h„  < h,  < . . . < h = r, 

^ n^n  X ox  V 

such  that 

r 

L(<(>)  = I A^(|)(-h^)  + D(0)({.(e)d0 

i=o  ' 

-r 

for  41  G '>2 0)  (where,  of  course,  the  proper  interpretation  must 
be  given  to  the  point  evaluations  of  (|»  if  this  "function"  is 
indeed  only  an  L2  "function")  . The  nonlinearities  of 

the  system  are  contained  in  the  mapping  f (-r,  0)  xR*”  -►  r”, 

about  which  we  make  several  fundamental  standing  assumptions  for 
the  presentations  in  this  paper: 

(HI)  The  mapping  (t,y,ip,v)  -►  f(t,y,i)),v)  is  continuous  on 

R^xR"xL2(-r,0)  xr”*. 

(H2)  For  any  bounded  subset  ^ of  Z there  exist  m^^  = , 

m,  i«l,2,  such  that  for  vGr"*,  tGR^,  and 

(x,(|») , (y,ij))G^  one  has 

|f  (t,x,4»»v)-f  (t,y,i|;,v)  I < {mj(t)+m2(t)  |v|  }{|x-y|  + |4)-i)i|  }. 
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(Ji3)  There  is  a continuous  mapping  t such  that 

It  f lU 

f(t,0,0,v)  = B(t)v  for  all  (t , v)  G R^xr"'.  Further,  there 

loc 

exist  functions  m.GL  , i=l,2  such  that 

1 f (t,x,4),  v)  1 £ {mj^  (t) +m2  (t)  jv]  >{  1x1  + 1 (|)  1 } + lB(t)llvl 

for  all  t,v  and  all  (x,<j))  with  l(x,((i)|2  sufficiently  large. 

Inn  X 

(H4)  There  is  a continuous  function  g:  R'‘'xr”xL2  (“C , 0)  ->  R"'’  such 

that 

1 f (t,x,  ((>,v)-f  (t,x,(<),u)  1 £ g(t,x,(ti)  |v-ul 
for  all  (t,x, (())  G R^xr”xL2  (-r, 0)  and  V/UGr"*. 

Remark  2.1.  We  observe  that  hypotheses  (H2)  and  (H3)  together 
yield  that  the  following  condition  is  satisfied  by  f: 

(G)  There  exist  mj^,m2  in  such  that  1 f (t,y,  i|),v)  | £ 

{mj^(t)+m2(t)  |v| }{ lyl  + UI  } + |B(t)||v|  for  all  (t,y,4i,v)  in 
R^xR"xL2(-r,0)  xr"*. 

Let  coordinate  projections  of  Z = R^xL^C-tjO) 

onto  R**  and  L2  respectively.  That  is,  Trj^(n,^)  = n,  '>t2(n,<(»)  = <!>• 
Define  a mapping  FzR^xzxr"*  z by 

F(t,z,v)  = (f  (t,iTj^z,ir2Z,v)  ,0)  . 

Several  properties  of  the  mapping  F will  be  used  in  the  sequel  and 
we  list  these  here  for  later  reference. 

Pj^!  For  any  bounded  subset  ^ of  Z there  exist  (depending 

on  in  such  that 

CP 

|P(t,z,v)-F(t,w,v) I < {Mj (t)+M2 (t) |v| } |z-w| 
for  all  z,wG^  and  tGR^,  vGR™. 
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P2’  For  any  z€  i^(  [0,  t^^l,  Z)  and  u6L2{0,tj^),  the  mapping 
t |F(t,z(t)  ,u(t)  ) I is  in  Lj^(0,tj^). 

Property  Pj^  follows  directly  from  (H2)  while  P2  is  a consequence 
of  (H2)  and  (H3). 

We  introduce  next  the  solution  semigroup  S(t):Z  Z of  the 
linear  part  of  (1)  as  used  extensively  in  the  discussions  in  Refs.  3 
and  A.  Thus  for,  (n,<t')€Z,  we  define  S(t)(ri,4')  = (x(t),x^) 
for  t ^ 0,  where  x is  the  solution  of  (1)  on  (0,®)  with  f = 0. 

It  is  easily  shown  that  {S(t)}^^Q  is  a C^-semigroup  with 
infinitesimal  generator  defined  on  = {(ri/4')|<('  is 

absolutely  continuous  with  <|)GL2(-r,0)  and  (t»(0)  = nl  by 
= (L(<p),ip).  Let  M,6  be  constants  such  that 
|s(t)  I £ II  exp(Bt)  for  t £ 0. 

We  shall  be  concerned  with  an  abstract  form  of  (1)  given 
by  the  implicit  equation 

t 

z(t)  = S(t)z^  + I S{t-a)F(a,z(a) ,u(a))da  . (2) 

o 

That  this  equation  is  in  some  sense  equivalent  to  (1)  is  the 
focus  of  the  remaining  discussions  in  this  section. 

Lemma  2.1.  Under  the  hypotheses  (H1)-(H3),  equation  (2)  defines 
for  each  uGL^COjtj^)  and  z^GZ,  a unique  function  t -►  z(t)  in 
'ir((0,tjI,Z)  . 

Proof:  For  any  two  continuous  solutions  z and  w of  (2)  on  [0,t^] 


we  have  using  Pj^  (since  {z  (t) ) , {w(t) } lie  in  a bounded  subset  of  Z) 
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|z(t)-w(t)|  < |m  exp[$(t-o)I{Mj^(o)+M2(cf)  |u(a)  I } (z(a)-w(a)  |d(j 


o 

t 


= I ^(a)  I z (a) -w(o)  Ida 


where  ^ (0,  tj^)  . Uniqueness  of  solutions  thus  follows  from  an 

application  of  Gronwall's  inequality. 

The  proof  of  existence  also  involves  only  quite  standard 
arguments.  Define  for  k=0,l,2,...  the  Picard  iterates  {z  } by 


z^(t)  = S(t)z, 


z’^(t)  = S(t)z^  + I S(t-a)F(a,z’'‘^(a)  ,u(a))da  , 


(3) 


for  t€[0,tj^].  Clearly  the  iterates  are  well-defined  and  an 
inductive  argument  employing  the  strong  continuity  of  {S{t)} 
and  property  ^ in  the  usual  manner  yield  easily  that 
z €if(  [0,  tj^] , Z)  for  ks0,l,2,...  . Using  (G)  in  the  second  equation 
of  (3)  one  obtains 

t 

Iz’^Ct)  I < M exp(6t)||zQ|  + j (2  [nij^  (a) +*2(0)  |u(o)  1 1 | z^“^  (0)  | + 1 B (o)  | | u (o)  | )do| 


< M exp(6t)  jfz^l  + Y + I >(o)  (z^"^{o)  (daj 


where  y and  Jt  €L2  are  independent  of  k.  A simple  inductive 
argument  using  this  inequality  can  be  made;  one  obtains 

|s*^(t)|  £PqM  exp(et2)exptH(t]^)] 
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t 

I 

s 

■ 

t 

» 

4 

/ 


1 

T 


t 

where  = sup{|s(t)z^|  + y : tG[0,tj^l}  and  H(t)  = H exp(8t^)  j jt  (o)da. 

o 

Thus  {z^}  is  bounded  in  [0 , tj^]  , Z)  . 

Use  of  this  boundedness  along  with  property  allows  us  to 
choose  in  L^(0,tj^)  such  that 

|F(a,z’'(cj),u(a)  )-F(o,z*'"^  (o)  ,u(o) ) | 


< {Mj^  (0) +M2  (o)  |u(o)  I } I z^(o)-z^”^  (o)  I 


for  all  k=l,2,...  . This  can  be  used  to  show  that  (z  } is  in  fact 
a Cauchy  sequence  in  ^{[0,t^],Z).  Indeed,  using  completely 
straightforward  arguments  one  can  obtain 


k+1 


V 


£ |m  exp(etj^)|  q 


ki 


where  q 


|F(o,z°(o) ,u(a) ) |do  and  u(t) 


t 

{Mj^  (o) +M2  (o)  I u (a)  I }do  . 


o o 

By  passing  to  the  limit  in  (3)  we  find  that  the  limit  z of 
the  Cauchy  sequence  {z  } is  the  desired  solution  of  (2). 

Returning  to  system  (1),  we  observe  that  for  a given 
uGL2(0,tj^)  and  (jiCAC*’ (-r , 0) , (|)(0)  = n»  standard  arguments  can  be 
employed  to  establish  (under  hypothesis  (Hl)-(H4))  existence  of  a 
unique  solution  on  finite  intervals  I0,tj^].  That  such  solutions 
are  continuous  in  (4i,u)  can  also  be  shown. 

Lertna  2.2.  Consider  the  mapping  (♦fU)  -*■  (x(t;<|>,u)  ,x^(<(>,u) ) from 
AC” (-r , 0)  ^L2  (0,  tj^)  -*■  Z where  x is  the  solution  of  (1)  corre- 
sponding to  ((|>,u)  under  hypothesis  (H1)-(H4).  This  mapping  is 
continuous. 
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Proof:  Fixing  (<ti,u)  in  AC^  (-r,0)  xL^  (0,tj^)  and  denoting  by 

x=x(i^i,u)  the  corresponding  solution  of  (1)  , we  shall  consider 
arguments  involving  y=y(i^/V),  solutions  of  (1)  corresponding  to 
(il'/V)  in  some  bounded  neighborhood  Bj^xB2  of  (il>,u)  in  (-r , 0)  xL™ (0 , tj^)  . 

Using  the  linearity  of  L,  the  growth  condition  (G) , and  Gronwall's 
inequality  along  with  quite  standard  reasoning,  it  is  easy  to  show 
that  (y  (t;  ij),  v)  1 1 E 10,  tj^ ) , G , V G B2)  lies  in  a bounded  subset  of 
That  is,  { (y(t;ii),v)  ,y^(ii',v)  ) |tE  [0,tj^]  Bj^,vEB2}  lies  in  a 
bounded  subset  of  Z so  that  the  local  Lipschitz  conditions  of  (H2) 
can  be  employed  with  the  functions  m^  independent  of  the  choice 
(ii',v)  in  Bj^xB2* 

From  (1)  we  obtain  immediately  for  x=x  ((j),u)  ,y=y  (\j>,v) 

t 

|x{t)-y(t)|  _<  I 0 (0) (0)  I + ||  L(x^-y^)da| 

o 


[f(o,x(o),x^,u{o))-f(a,y(a),y^,v(a))ldo 


= Tj+T2+T3, 


(4) 


Using  (H2)  and  (H4),  we  find 


T < 
3 - 


|||  f (o,x(o)  ,x^,u(o)  )-f  (a,x(a)  ,x^,v(o) ) I 
o 

+ |f(o,x(a),x^,v(o))-f(o,y(o),y^,v(o)  )||da 
t 

1 ||g  (o,x(o)  ,x^)  |u  (o)  -V  (0)  I +2  tmj^  (o)+m2  (0)  1 v (o)  | lj(x  (a)  ,Xg)  - (y  (o)  ,y^)  j ^jda 
o 

t 
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From  the  form  of  L we  obtain 

T2  1 [ I I A. (x(o-h^)-y (o-h. ) ) 
1 I i=o 


o 

t 


! ^ 

< I { I (A^I (x(o)-y (o) I + |d| 
• ^ 


0 

1 D(0) [x(o+0)-y(o+0) ld0 
-r 


. }do 
^2 


do 


? " 

+ I |a. I lx(o)-y(o) |do 

^ i=l  ^ 

-r 

< r(I|A^|) Ix^-y^l^  + I e| (x(a),x^)-(y(o),y^)|2do 


V 

where  E e 2{^  jA^j^j  + |d|} 
o 


Using  these  estimates  in  (4)  we  thus  have 

t 

|x(t)-y(t)|  + |g(x)||u-v|  + [ (5^^  (o) +E)  A (o)  do 

o 

where  A(o)  = | (x  (o) , x^)  - (y  (o)  , y^)  | 2 • But  since  |x^|j^  £ /r|x^| 

2 Jb 

this  implies 


t 

A(t)  £ /r+r  I rl^-tl  + |g(x)  I lu-vj  + I t5^(o)+E)  A(o)do  j . 

o 

With  an  application  of  Gronwall's  inequality  we  then  obtain 

t 

A(t)  < /1+r  |r|((i-ij)|  + |g(x)  I |u-v|  I exp  | 1 (o) +E)do| , 

o 

from  which  the  desired  continuity  follows  easily  once  one  notes 
that  the  exponential  term  is  bounded  uniformly  in  vEBj. 

Arguments  almost  identical  to  those  above  (employing  instead 
of  (H2)  along  with  Gronwall's  inequality)  can  be  used  to  establish 
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an  analogous  continuity  result  for  the  functions  defined  by  (2)  . 

We  shall  therefore  only  state  this  continuity  property,  omitting 
the  proof. 

Lemma  2.3.  The  mapping  (<ti  (0)  ,i|)  ,u)  ->  z(t;<J>,u)  where  z is  definer 
by  (2)  with  z^  = ((|i(0),(|))  is  continuous  on  in 

the  ZxL2  topology  fend  hence  in  the  R”xSfxL2  topology)  . 

The  preceding  discussions  allow  us  to  establish  with  relative 
ease  a desired  equivalence  between  systems  (1)  and  (2) . 

Theorem  2.1.  Suppose  f satisfies  (H1)-(H4).  Then  for 
= (<))  (0)  ,(J))  G and  uGL2(0,tj^)  we  have 

z(t;((),u)  = (x(t;(|),u)  ,x^(()>,u) ) 

where  t -*■  x(t;((i,u)  is  the  solution  of  (1)  and  t -*■  z(t;()),u)  is 
defined  by  (2)  with  z^  = (i^(0),(ti). 

Proof:  We  shall  say  that  t ^(t)  is  a strong  solution  (in  Z)  of 
C(t)  -M(t)  + F(t,C(t),u(t)) 

(5) 

C(0)  = (()>(0),^) 

if  t -♦  C(t)  is  continuous  for  t ^ 0,  for  t > 0 with 

C(t)Gi^(Q/)  for  t > 0 and  the  equation  in  (5)  is  satisfied  for 
t > 0. 

We  first  consider  solutions  for  ♦GC^^^  and  uGC^®^  and  define 
t -*■  w(t)  by  w{t)  = (x(t;(^,u)  ,x^(^,u) ) where  x is  the  corresponding 
solution  of  (1) . Direct  computations  for  ^ w(t)  in  the  Z-norm 
show  that  w(t)  ■-a^w(t)+F(t,w(t)  ,u(t) ) and  that  the  necessary 
continuity  requirements  obtain  so  that  t w(t)  is  a strong  solution 
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of  (5).  Next  let  t -*■  C(t)  be  any  strong  solution  of  (5) 
corresponding  to  and  define  X(o)  = S(t-o)C(o) 

for  0 £ o £ t.  We  observe  then  that  the  assumptions  on  f (see 
(HI))  imply  that  in  this  case  o ->  F (o,  ? (o)  ,u  (o) ) is  continuous. 
Again  direct  computations  (e.g.,  for  c>0,  write  it\(o+E) -X  (o)  ] = 

S ( t-o-G ) {^[^ (o+e) -C (o) 1 + ^[I-S (e) 1 ^ (o) } and  take  the  limit  as 

+ * + 

e -♦  0 ) reveal  that  the  right  derivative  X (o)  exists  and  equals  the 

continuous  function  a -*■  S (t-o)F(o,  ^ (o)  ,u(o)  ) . It  follows  in  the 

usual  manner  that  X exists  continuously  and  indeed  for  t > 0 

. t t 

X(t)-X(0)  = I X(o)do  = I s(t-a)F(a,q(o) ,u(o) )da. 

o o 

Since  X (t)  = S (0) C (t) , X (O)  = S(t)C(0)  we  thus  find  that  any 
strong  solution  C of  (5)  corresponding  to  G C ,u  G C must 
satisfy 

t 

c(t)  = s(t)^(0)  + I s(t-o)F(a,C  (o)  ,u(a)  )da  . 

o 

By  Lemma  2.1  we  see  that  any  such  strong  solution  of  (5)  must 

be  the  unique  solution  of  (2)  in  5f(  (0,  t^],  Z)  . But  we  saw  above 

that  t w(t)  = (x(t),x^)  is  a strong  solution  of  (5)  for  ♦jU 

so  restricted.  It  thus  follows  that  w(t)  = (x(t),x^)  = z(t) 

where  z is  defined  by  (2)  with  z > (^(0),<^)  and  the  desired 

o 

equivalence  is  obtained  for  ♦ G C ,u GC . 

Finally,  from  Lemmas  2.2  and  2.3  we  see  that  for  each  t 
the  mappings  ((K0),(p,u)  -►  (x(t)  ,x^) , ((p  (0)  ,^,u)  z(t)  from 
S^(Qrt  xLj  (0,  tj^) -»Z  are  continuous  ii  ttie  R^x  SfxLj  topology 
This,  combined  with  the  denseness  properties  of  the  set 


f 
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{ (♦  (0) , <>,u)  I (>  € C £ C } on  which  equivalence  has  already 

been  proven,  can  be  used  to  establish  the  equivalence  as  claimed 
in  Theorem  2.1. 
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3 . Approximation  and  convergence 

Having  shown  in  the  previous  section  that  the  control  system 

t 

z(t)  = S(t)z^  + I S(t-o)F(o,z(o) ,u(o) )da  (6) 

o 

is  equivalent  to  our  original  system  (1)  , we  turn  next  to 

approximations  of  (1)  via  approximations  of  (6).  Let 
N N N 

(S  (t)},  Z , P be  a general  approximating  scheme  as  discussed 

in  the  first  part  of  section  3 of  Ref.  3.  That  is,  {S^(t)}  is  a family 

of  approximating  C^-semigroups  satisfying  ls^(t)|  exp(6t)  and  the 

M 

convergence  criteria  S (t)2  S(t)z,  zEZ,  uniformly  in  t on 

finite  intervals.  The  subspaces  Z C Z contain  all  elements  of  the 

form  (C»0),  and  the  linear  operators  P^:Z  ■*  satisfy  the 

condition:  P z^  -►  z^  for  all  initial  data  of  interest  in  studying 

the  control  system  (6)  (or  (1)). 

An  example  of  such  an  approximating  scheme  is  the  "averaging" 

approximation  scheme  discussed  in  some  detail  in  Ref.  3 (see  also  Ref.  4) . 

While  our  discussions  in  this  section  will  deal  with  general 

approximating  schemes,  we  shall  use  the  particular  "averaging" 

scheme  of  Ref.  3 to  discuss  nixnerical  results  for  ncnlinecu:  control 

systems  in  a subsequent  section  of  this  paper. 

Given  a general  approximating  scheme  (S  (t)},  Z , P as 

described  above,  we  consider  the  approximating  system 

t 

z**(t)  « S^(t)P^z^  + I S**(t-a)F(a,z^(o)  ,u(o)  )da.  (7) 

o 

Using  arguments  exactly  as  in  the  proof  of  Lemma  2.1,  one  can  easily 
verify  that  (7)  defines,  for  each  N,  a unique  function 


A 


I 
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[0,tj),Z)  . We  shall  show  that  for  a given  u€L2{0,tj^), 
the  solutions  of  (7)  actually  converge  to  the  solution  of  (6)  . 

Lemma  3.1.  Suppose  is  a bounded  subset  of  L2(0»tj^).  The 
sequence  {z^}  defined  by  (7)  is  bounded  in  [0, tj^l  ,Z) , 
uniformly  in  uE^^. 

Proof!  We  have  immediately  from  the  assumptions  on  S**,P*^  that 

t 

I 2*^(t)  I £ M exp(et)6  + j M e3q3[6(t-o) ) |F(o,z''*(o)  ,u(o))  |da 

o 

where  6 is  independent  of  N and  u.  .... 

But  the  condition  (G)  on  f implies  the  existence  of  M, ,M-  in 

12  <*> 

such  that 


|F(t,z,v)|  < {Mj(t)+M2(t) |v| } (z|+(B(t) I |v| . 

Thus  we  find 

t 

|2’^(t)  I < M exp(6t)[6  + j |b(0)  I |u(a)  (daj 

o 

t 

+ I M e5qj[6(t-o)]{Mj^(a)4M2(a)  |u(o)  |}|z^(o)  Ida 
o 

t 

< M exp(Bt){6+|B(j^(u|j^}  +1  ky(o)(z^(o)(do, 

o 

which  upon  an  application  of  Gronwall's  inequality  yields 

Iz'^Ct)!  < M exp(et){6+(B||u|}exp  | j k^(o)doj. 

o 

Since  k^^(a)  - m exp(Bt^)  (M^(o)+M2  (o)  (u(a)  | },  it  is  clear  that 
for  u € the  right  side  of  this  last  inequality  is  bounded 
uniformly  in  u and  N. 
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r 

Lenuna  3.2.  For  bounded,  {z^}  is  Cauchy  in 

uniformly  in  uGi^. 

Proof;  For  u Gi#’ and  each  positive  integer  j we  denote  by  the 
function 

g^(a)  = f (a,Trj^z^  (a)  ,7T2Z^  (o)  ,u  (o) ) . 

Since  |g^{o)|  £ 2{mj^  (o) +m2  (o)  [u  (o)  | } | z^  (o)  | + | B (o)  | | u (o)  | 

(see  condition  (G) ) with  m^GL^°^  and  {z^}  uniformly  bounded,  it 
is  easily  argued  that  g^GL2(0,tj^)  with 


uniformly  in  uGj^’and  j=l,2,...  . 

Let  T^(t);R”  and  T(t):R"  -►  Z be  defined  by 

T**{t)C  H s”(t)(C,0),  T(t)5  = S(t)(C,0)  where  CEr". 


T(t) 
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Thus,  for  N,K  > 0 and  t £[0,tj^J  we  may  write 

|z^(t)-z**'*'‘^(t)  I < I (S*^(t)P*^-S*^‘^’^(t)P^'*'’^)z  I 

— o 

t 

+ I |s”(t-o)  (g^(o)  ,0)-S^'^^(t-o)  (g”'^^(a)  ,0)  Ido 

o 

t 

< ej^(N,K)  + I |T^(t-a)g^(o)-T^'^^(t-a)g^'^’^(o)  Ido 

o 

t 

< ej^(N,K)  + I I (T^(t-o)-T^'*'^(t-o)  )g”(o)  Ido 


o 

t 


+ I |T”'^^(t-0)  {g**(o)-g*^‘*‘^(o)  } Ido 
o 

t 

< e^(N,K)  + /eJInTIH'  Ig^lj^  + | M exp  [8  (t-o)  ] |gN(o)-g^^K(o)  |jjo 


< ^1  + /e^  ^ I J Ido, 


(8) 


where  e(N,K)  “ A -*•  0 as  N,K  ■+•  »,  uniformly  in  uG^. 

From  the  boundedness  property  of  {z^}  and  property  Pj^,  we  have 

|g'*(o)-g”'^*'(a>  I ^ - |F(o,2**(a),u(a))-F(o,z*^'^*^(o),u(a))  I 


< {M^(o)+M2(o)  |u(o)  I }|z”(o)-z**'*'’'(o)  I 


where  Letting  Jt^ia)  s m exp ( 8t^)  {H^  (o) +M2  (o)  | u (o)  | ) 

we  are  thus  able  to  rewrite  the  inequality  (8)  as 

t 

|z*'(t)-z**'^''(t)  1 < e(N,K)  + I |z**(o)-z**'^*'(o)  )do. 


I 


18 


Once  again  appealing  to  the  Gronwall  inequality,  we  conclude 

t 

|z^(t)-z^'’'’^(t)  I < t(N,K)  expjj  (a)do| . 

o 

It  follows  immediately  that  {z*^}  is  Cauchy  in  S^(  [0,  t^^I,  Z)  , 
uniformly  in  u € 

Theorem  3.1.  Let  z,z^  be  defined  by  (6),  (7)  and  let 
!l^CL2(0,t^)  be  bounded.  Then  z^(t;u)  z(t;u)  as  N ® , uniformly 
in  t on  [0,tj^l  and  in  u € . 

Proof ; From  the  previous  lemma  we  know  that  there  exists 
zG^(  [0,tj^]  ,Z)  satisfying  z*^(t;u)  z(t;u)  uniformly  in  t and  in 
u€if.  We  claim  that  this  limit  function  is  the  unique  solution 
in  5r(l0,t^l,Z)  of  (6)  guaranteed  to  exist  by  Lemma  2.1.  This 
follows  immediately  upon  passing  to  the  limit  in  (7)  once  one 
has  observed  that  F (o,  z^  (o)  ,u  (o) ) -*•  F(o,  z (o)  ,u  (a)  ) and  S^(t) 
converges  strongly  to  S(t)  with  both  sequences  being  dominated. 


To  make  practical  use  of  the  convergence  results  of  Theorem  3.1 
in  optimization  problems,  one  must  argue  a little  more  than  is 
promised  in  this  theorem.  More  precisely,  one  wishes  to  replace 
the  control  system  (1)  by  an  equivalent  system  (6)  and  then 
solve  the  optimization  problem  governed  by  (6)  (see  Refs.  3 and  4). 
To  obtain  an  approximate  solution,  one  solves  the  optimization 
problem  subject  to  (7) . This  results  in  a sequence  {u**}  of 
(hopefully)  approximating  optimal  controls  and  one  desires,  of 
course,  that  z**(t>u^)  ■*  z(t;u)  where  u is  the  solution  to  the 
optimization  problem  constrained  by  (6).  The  sequence  {u**}  is 
often  (e.g. , see  Ref.  4)  bomded  In  L2(0,tj^)  so  that  weak  convergence  of 
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{u^}  (or  of  a subsequence  at  least)  to  u can  be  argued.  From 
the  inequality 

I z^(t;u*^)-z  (t;u)  I < | z** (t;u^) -z  (t;u^)  | + ( z (t; u^) -z  (t ;u)  ( 

and  the  convergence  results  of  Theorem  3.1,  it  clearly  suffices 

V 

for  our  purposes  to  establish  that  z(t;u  ) ■+  z(t;u)  whenever 
K in 

u — ^ u in  L2(0,tj^).  However  it  is  not  obvious  from  the  implicit 

defining  equation  (6)  for  z that  the  dependence  of  z on  u is 

even  linear  or  affine  and  thus  it  is  not  surprising  that  further 

assumptions  on  f are  required  to  obtain  the  desired  results.  For 

nonlinear  systems  which  are  affine  in  the  control  terms  we  can, 

K K 

as  might  be  expected,  show  that  u — u implies  z(t;u  ) z(t;u). 

Assumption  3.1.  Suppose  that  f ;R^^B^l2(-r,0)>«Ff''-»- has  the  form 

f(t,y,il),v)  =>^(t,y,ip)  + { B(t)}v 

where  B is  continuous  and  R*^,  >#f:R^xz  ->■  catisfv: 

1 ' 2 n,m  ^ 

(Al)  The  mappings  (t,y,i|<)  (t,y,ij;) , i=l,2,  are  continuous. 

(A2)  For  ^ a bounded  subset  of  Z,  there  exist  functions 

k^,k2  (depending  possibly  on  such  that 

U.  (t,y,4')-  >^(t,x,<|>)  1<  k^(t)  I (x,<)))-(y,4») 

for  all  (x,(>),  (y,4»)6^,  tGR^,  i»l,2. 


(A3)  >f'^(t,0,0)  “ 0,  i-1,2  and  there  exist  functions  k^,k2  in 
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I < k^(t)  I (y,il/)  I2 

for  all  (y,ip)  GZ  with  | (y,i|>)  | sufficiently  large,  i=l,2. 

It  is  not  difficult  to  verify  that  any  f satisfying 
Assumption  3.1  will  also  satisfy  hypotheses  (Hl)-(H4)  of  section  2. 

Theorem  3.2.  Suppose  f satisfies  Assumption  3.1  and  z is  defined 
for  u€L2(0,t2)  by  (6).  Then  u^— ^u  in  L2(0,tj^)  implies 

V 

z(t;u  ) -*•  z(t;u)  as  K . 

Proof:  We  first  observe  that  {u  } bounded  in  L2  along  with 
condition  (G)  and  Gronwall's  inequality  (again!)  imply  that 
{z(t;u*^)  |t  G I0,tj^]  , K=1,2,...}  lies  in  a bounded  subset  of  Z 
(see  the  proof  of  Lemma  2.2)  so  that  we  can  employ  the  local 
Lipschitz  conditions  of  (A2)  in  our  arguments.  Letting 
-^J^(t,z)  i (^(t,Trj^z,TT2Z)  ,0)  , i^(t,z)v  i (3^(t,Trj^z,v^z)v,0), 

tr 

Zj^(t)  = z(t;u  ),  and  z(t)  = z(t;u),  we  have 
I Zj^(t)-Z  (t)  I 

S (t-o)|/f'j^  (a,  Zj^  (o)  )-'^  (o,  z (a) ) +^(o,  Zj^  (a)  )u^  (a)  ->^(o,  z (o)  )u  (a)  jdo 
M exp  1 3 (t-o)  ] (o)  I Zj^  (a) -z  (o)  (do 

» t 

S(t-o)  I (y(a,Zj^(o)  )-^(a,z(a) ) ]u^  (a) +^(a,  z (a) ) Iu*^(o)-u(o)  ijdoj 
M exp(8t)  {kj(o)+k2  (o)  ^*^(0)  I ) | Zj^ (o) -z  (o)  |da 
S(t-a)  (Aj^(a)  ,0)do|  , 
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where  Aj^(a)  = {yf^(a,TTj^z(a)  ,11^2(0) ) + B (a)  } (u^  (a) -u  (a)  } . 

But  from  Theorem  3.2  of  Ref.  4 we  find  that  the  operator 

-*■  5f(  [0, tj^l  ,Z)  given  by 
t 

5/’{q)  (t)  = I S (t-o)  (g  (o)  , 0)da 
o 

is  a compact  linear  operator;  thus  — ^0  in  L^COrtj^)  implies 

(t)  -*•  0 uniformly  in  t on  [0,tj^l.  It  follows  that 
t 

1 Zj^(t)-2  (t)  I 1 + I M sxp  (ptj^)  (kj^  (a) +k2  (O)  |u^  (o)  I } I Zj^(o) -2  (0)  |do 

o 

where  r 0 as  K-*“  , so  that  one  final  application  of  the  Gronwall 
inequality  yields  1 

I Zj^(t)-z  (t)  I ^ej^expjj  >fj^(o)do  I 

o 

and  the  right  side  of  this  inequality  0 as  K , uniformly 
in  t on  [0,  tj^l  . 

■f 

t 

I 

! 

1 

I 

t 
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4 . Numerical  Results 

N •»  N N 

One  specific  choice  of  {S  (t)},  Z , P in  the  approximation 
scheme  described  in  section  3 leads  to  the  so-called  "averaging" 
approximations  that  we  have  discussed  in  some  detail  in  previous 
efforts  dealing  with  linear  systems  (see  Refs.  3 and  4) . We  turn  now 


to  a summary  of  some  of  our  numerical  results  for  nonlinear 
systems  obtained  using  these  averaging  approximations  in  connection 
with  the  general  nonlinear  approximation  scheme  developed  above. 

We  first  recall  briefly  the  specific  class  of  approximations 
of  interest  to  us  here.  We  do  this  for  the  linear  operator  L 
representing  a simple  differential-difference  equation  (for  a 
discussion  of  the  general  linear  case,  see  Ref.  3)  given  by 

L((^)  = A^((>(0)  + . (9) 


For  each  positive  integer  N,  let  {t^}  be  the  partition  of 

[-r,0]  given  by  t ^ , j=0 , 1 , 2 , . . . ,N,  and  define  Xj 

N N 

characteristic  function  of  (tj,tj_j^)  for  j=2,...,N  and  that  of 
(tJ,tQ]  = [-  g ,0]  for  j=l.  Let  = { (n,<0)  | n Gr", 

'I'  = Z v.x^,  V.  €R  } and  P (n,'(i)  = (n,  Z where 

i=l  J J J 1 J J 

.N 

j * p I ((>  (s)ds. 


(fi  (s)ds. 


N ^N 

Define  the  semigroups  S (t)  * exp  (Ur  t)  where  Ur  :z  ->  Z is  given  by 
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N 

with  (|)^  E n.  It  is  shown  in  Ref.  3 that  these  choices  lead  to 
a scheme  satisfying  the  convergence  requirements  of  section  3. 

Consider  next  application  of  these  ideas  to  systems  (1) 
with  L given  by  (9)  and  n = (j)(0).  That  is, 


x(t)  = A^x (t) +A2X (t-r) +f (t,x (t) ,x^,u (t) ) 
x(0)  = H/  x^  = 4)  . 


(10) 


Defining  the  functions  e^  = (1,0),  e^  = (0,Xj)»  j=l,2,...,N, 

M 

and  letting  z (t)  in  (7)  be  represented  by 

z*^{t)  = ^ w^(t)e^,  where  w^(t)ER*',  it  is  not  difficult  to  argue 

j=o  ^ ^ ^ 

that  the  approximating  system  (7)  for  (10)  can  be  written  as  an 
ordinary  differential  equation  for  the  "Fourier"  coefficients  w^. 
Thus,  the  approximation  to  (10)  is  equivalent  to  (see  Ref.  4) 

^o^^’  = A^w”+Aj^w5J(t)+f  (t,w^(t),  f w^(t)Xj,u(t)) 

Wj(t)  = p (t) -Wj  (t)  } , j=l,2,...,N,  (11) 

w”(0)  = col((^(0),<^5’,  ...,<^JJ). 


For  f satisfying  the  hypotheses  of  section  2,  (4*  (0)  ,4')  6 and  u E .5^, 

the  convergence  guaranteed  by  Theorem  3.1  implies  that  w^(t)-^x(t) 

uniformly  in  t on  [0,t,  ] and  in  uS5^,  where  w^,x  are  the  solutions  of 

1 o 

(11)  and  (10)  respectively.  We  first  present  results  for  an 
example  which  illustrates  this  convergence  of  w^  to  x in  a system 
not  subject  to  optimization. 
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Example  4.1.  Consider 

x(t)  = -1 . 5x (t) -1 . 25x (t-1) +x (t) sin  x(t),  t > 0 
x(0)  = 100+1,  -1  < 0 < 0, 

with  corresponding  approximating  equations 

w^(t)  = -1.5w^(t)-1.25wJ|J(t)+w^(t)sin  w^(t) 

Wj(t)  = (t) -Wj  (t)  } , j=l,2,...,N. 

N 

In  Table  1 we  list  selected  values  for  the  variables  w corre- 

o 

spending  to  several  values  of  N.  These  are  compared  to  the  "true" 

solution  values  x which  were  obtained  using  one  of  the  block 

methods  (of  fourth  order)  developed  by  Tavernini  (see  Ref.  10,  pp.  77-78) 


directly  on  the  delay  system. 

Table  1.  Results  for  Example  4.1. 


Time 

w8 

o 

16 

w 

o 

20 

w32 

o 

X 

0.  0 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

in 

• 

o 

3.1589 

3.2363 

3.2519 

3.2754 

3.2532 

1.0 

2.2586 

2.2808 

2.2871 

2.2989 

2.3856 

1.5 

0.6754 

0.4929 

0.4488 

0.3755 

0.3446 

2.0 

-0.6484 

-0.8157 

-0.8503 

-0.9026 

-0.9137 

2.5 

-0.6967 

-0.7269 

-0.7312 

-0.7363 

-0.7655 

3.0 

-0.2656 

-0.1766 

-0.1524 

-0.1106 

-0.1060 

3.5 

0.1109 

0.2418 

0.2737 

0.3257 

0.3558 

4.0 

0.2454 

0.3286 

0.3463 

0.3735 

0.4017 

4.5 

0.1626 

0.1547 

0.1492 

0.1377 

0.1426 

5.0 

0.0137 

-0.0478 

-0.0644 

-0.0923 

-0.1054 
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The  above  example  typifies  some  of  the  worst  convergence 
behavior  we  have  observed  in  using  the  averaging  approximations 
to  integrate  nonlinear  equations.  In  our  efforts  at  Brown  and 
in  numerical  work  in  collaboration  with  F.  Kappel  and  H.  Froschl 
of  the  University  of  Graz,  we  have  applied  these  approximations 
to  a number  of  other  nonlinear  FDE  examples  (2  and  3-dimensional 
vector  equations  and  other  types  of  nonlinearities  typical  of 
those  arising  in  several  areas  of  applications) . In  these 
instances  we  found  convergence  of  the  approximations  no  worse  than 
that  illustrated  in  Example  4.1  and  in  a majority  of  the  cases 
convergence  was  actually  much  more  rapid  than  in  the  example 
detailed  above. 

We  have  also  used  the  "averaging"  approximations  to  solve  for 
optimal  controls  in  problems  governed  by  nonlinear  delay-differ- 
ential control  systems.  Typical  examples  of  interest  are  of  the 
form  (for  more  general  problems  to  which  our  framework  in  sections  2 

and  3 is  applicable,  see  Refs.  3 and  4) ; 

^1 

Minimize  J = j x ( tj^)  Gx  (tj^)  j | (x  (t)  Qx  (t) +u  (t)  Ru  (t)  }dt 

o 

subject  to  (10)  where  u€'3r,  ^ is  a closed  convex  subset  of 

L2(0,ti).  Here  we  assume  G,Q  ^0,  R > 0. 

The  approximating  problems  (for  the  averaging  approximations) 

are  given  by:  . 

'l 

Minimize  j ''o  ^^1^  ®''o  ^^1^  W {w^  (t)  Qw^  (t) +u  (t)  Ru  (t)  }dt 

o 


subject  to  (11)  where  eigain  uG^. 
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Let  u be  a solution  of  the  approximating  problem  (while 

standard  existence  arguments  are  applicable  here,  the  usual 

. . N . 

uniqueness  assertions  are  not  appropriate  since  J is  not 

necessarily  even  convex  in  the  control  u)  with  index  N.  Denote 

_M  _M  _N  N — N 

by  X the  corresponding  trajectory  and  J = J (u  ) the 

corresponding  value  of  the  cost  functional.  Then  under 

Assumption  3.1  of  section  3,  Theorems  3.1  and  3.2  along  with  standard 

arguments  (see  Theorem  4.1  of  Ref.  3)  can  be  employed  to  conclude 

that  a subsequence  of  { (u^,x*^)  } converges  to  a solution  (u*,x*) 

of  the  original  problem  governed  by  the  system  (10)  (the  sequence 

{ (u^,j^) } itself  converges  if  the  solution  (u*,x*)  of  the  original 

problem  is  unique) . We  present  results  for  several  examples  to 

illustrate  this  use  of  the  approximating  scheme  in  such 

optimization  problems. 

Example  4.2.  Take  as  the  original  problem  the  task  of  minimizing 

2 

J = I x(2)^  + I I u(t)^dt 
o 

over  u = L2(0,2)  subject  to 


x(t)  = sin  x(t)+x(t-l)+u(t) , 0 £ t £ 2, 

x(0)«l,  -1<0<O. 


(12) 


of 


The  corresponding  approximating  problem  entails  minimization 


.N 


jlw^(2))^  + J I u(t)^dt 
o 
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over  the  same  class  of  controls  but  subject  to 
w^(t)  = sin  w^(t)  + wJJ(t)  + u{t) 

Wj(t)  = N{Wj_^(t)-  w^(t) } j=l,2,...,N,  (13) 

w”(0)  = col (1,1, ... ,1) . 

One  can  apply  the  necessary  condition  for  nonlinear  differential 
equation  control  problems  along  with  standard  computational 
routines  (gradient,  conjugate-gradient)  - see  Ref.  11  and  Chapter  X 
of  Ref.  12  - to  obtain  extremals  (u*^,w^)  for  the  approximating  problems 
This  was  done  (in  a manner  described  in  more  detail  below)  to 
obtain  the  values  reported  in  the  tables  presented  in  this  paper. 

One  can,  on  the  other  hand,  use  the  necessary  conditions  for 
optimality  of  delay  system  problems  (see  Ref.  13,  Theorem  VII. 2. 31) 
and  worlc  directly  with  the  original  optimization  problem.  If 
(u*,x*)  is  a solution  of  this  problem,  these  necessary  conditions 
guarantee  the  existence  of  a multiplier  X such  that  u*  maximizes 
H = - j u + X{sin  x*+x* (t-1) +u}  (i.e.  u*=X)  where  X satisfies 

X(t)  - -X(t)cos  x*(t)-X(t+l) , 0 < t < 2, 

X(t)  =0,  t > 2,  (14) 

X(2)  - -x*(2), 

with  X*  the  solution  of  (12)  corresponding  to  u*u*»X.  Thus,  we 
see  that  direct  solution  of  the  minimization  problem  for  extremal 
pairs  (u*,x*)  requires  solution  of  the  mixed  advanced-delayed 
system  two-point  boundary  value  problem  (TPDVP)  consisting  of  (12) 
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r 

(with  u*\)  and  (14).  In  the  results  listed  below  (for  this 

and  other  examples) , we  have  given  values  for  an  approximate 

numerical  solution  of  the  associated  TPBVP  under  the  columns 

headed  x*  and  . These  values  can  be  used  as  a rough  but 

— N 

independent  chec)t  on  the  convergence  of  the  trajectories  w^  and 
controls  u^  where  (u*^,w*^)  are  the  computed  extremal  pairs  for 
the  approximating  problems.  In  Table  2 we  give  a sujiunary  of 
selected  values  for  the  controls  and  in  Table  3 we  give 
corresponding  trajectory  values  along  with  the  optimal  payoff 
values  J . For  this  example  we  see  that  the  approximate 
solution  for  N=4  is  a reasonably  good  one. 


Table  2.  Results  for  Example  4.2. 


Time 

-4 

u 

-20 

u 

X * s u* 

0.00 

-3.5737 

-3.5704 

-3.5663 

-3.5686 

0.25 

-2.7615 

-2.7706 

-2.7769 

-2.7779 

0.50 

-2.0473 

-2.0594 

-2.0664 

-2.0561 

0.75 

-1.4938 

-1.5052 

-1.5073 

-1.4849 

1.00 

-1.0889 

-1.1018 

-1.1041 

-1.0577 

1.25 

-0.8030 

-0.8226 

-0.8371 

-0.8197 

1.50 

-0.6067 

-0.6337 

-0.6558 

-0.6400 

1.75 

-0.4722 

-0.4994 

-0.5180 

-0.5010 

2.00 

-0.3751 

-0.3981 

-0.4112 

-0.3938 

Table  3.  Results  for  Example  4.2. 


Time 

-4 

w 

o 

-8 

w 

o 

X* 

0.00 

1.0000 

1.0000 

1.0000 

1.0000 

0.25 

0.6381 

0.6372 

0.6367 

0.6270 

0.50 

0.4098 

0.4081 

0.4059 

0.3880 

0.75 

0.2882 

0.2932 

0.2948 

0.2743 

1.00 

0.2396 

0.2581 

0.2749 

0.2747 

1.25 

0.2360 

0.2661 

0.2961 

0.3172 

1.50 

0.2599 

0.2932 

0.3231 

0.3415 

1.75 

0.3043 

0.3336 

0.3558 

0.3672 

2.00 

0.3709 

0.3939 

0.4099 

0.4187 

T”  2.8759 


2.9176 


2.9437 


I 


i 


, : 
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Example  4.3.  In  this  example  and  the  next,  we  choose  a system 
with  nonlinearity  satisfying  a local  but  not  global  Lipschitz 
condition.  In  the  case  where  the  approximation  to  the  initial 
data  is  good,  the  approximation  method  is  quite  good  for  low 
values  of  N.  The  problem  is  to  minimize 


2 

J = j x(2)^  + ^ I {x(t)^+u(t)2}dt 
o 

over  uE‘3f=  L2(0,2)  subject  to 

x(t)  = x(t)sin  x (t) +x (t-1) +u (t) , 0 £ t £ 2, 
x(0)  = 10,  -1  < 0 < 0. 


(15) 


The  approximating  problem  is  one  of  minimizing 

= 7[w^(2)1^  + ^ I {w”(t)^  + u(t)^}dt 
o 

over  but  subject  to 

w^(t)  = w^(t)8in  w^(t)+w^(t)+u(t) 

w”(t)  - N{w”_j(t)-w”(t) } j»l,2,...,N,  (16) 

w”(0)  « col(10,...,10). 

Q — fl 

In  Table  4 we  give  values  for  w^,  u and  x*,A*  only  since  this 
suffices  to  demonstrate  the  approximation  scheme  as  applied  to 
this  example.  We  also  made  computations  for  N » 16  and  32  and 
obtained  values  for  that  differed  only  slightly 

—8*  -ft 

from  and  x*,u*.  Values  for  the  payoff  %fere 
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J®  = 162.019,  = 162.018,  = 162.015.  Graphs  of  u®  and  u* 

reveal  strikingly  good  qualitative  agreement  between  these  two 
functions  (see  Figure  1) . 


Table  4.  Results  for  Example  4.3. 


Time 

— 8 
w 
o 

X* 

0.00 

-1.4959 

-1.5159 

10.0000 

10.0000 

0.25 

-1.7215 

-1.7275 

10.3038 

10.2700 

0.50 

-1.7683 

-1.8690 

10.3399 

10.3214 

0.75 

-1.7787 

-2.0592 

10.3470 

10.3164 

1.00 

-1.7465 

-1.6550 

10.3589 

10.3184 

1.25 

-1.6812 

-1.6677 

10.3790 

10.3750 

1.50 

-1.7365 

-2.0244 

10.3927 

10.3770 

1.75 

-2.5597 

-3.3634 

10.3485 

10.2916 

2.00 

-9.8694 

-9.9181 

9.9181 

9.9174 

I 
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1 

I 


I 


Example  4.4.  This  example  is  exactly  the  same  as  that  in 
Example  4.3  except  for  a change  in  initial  data.  In  (15)  we 
choose  initial  function  x^  = 4)  where 


♦ (0) 


lO(d-hl) 

-100 


-1  < e < -.5 


.5  < e < 0 


with  a corresponding  change  in  (16)  given  by 
w*^(0)  = col((K0)  ,4.”,  ...,(^jj)  . 

Our  numerical  results  are  presented  in  Tables  5 and  6.  The 
approximate  numerical  solution  X*,x*  to  the  TPBVP  for  the 
original  problem  is  not  a particularly  good  one  (one  should  have 
X*(2)  = -x*(2)  and  we  have  a relative  error  here  of  about  5%) 
but  it  is  adequate  for  an  independent  checlc  that  the  method  is 
producing  solutions  converging  to  an  extremal  for  the  original 
problem. 

The  averaging  method  does  not  approximate  this  initial  data 
particularly  well  for  low  values  of  N and  as  one  might  expect, 
somewhat  higher  values  of  N are  required  to  obtain  an  approximate 
solution  of  accuracy  comparable  to  that  in  the  previous  example. 
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Table  5.  Results  for  Example  4.4. 


Time 

58 

0^2 

-48 

u 

X*=u* 

0.00 

-2.3083 

-2.3073 

-2.3032 

-2.3015 

-2.3321 

0.25 

-2.1804 

-2.2375 

-2.2706 

-2.2833 

-2.3238 

0.50 

-1.9855 

-2.1114 

-2.1972 

-2.2316 

-2.2768 

0.75 

-1.5751 

-1.6470 

-1.6908 

-1.7064 

-1.7374 

1.00 

-1.1446 

-1.1417 

-1.1318 

-1.1252 

-1.1245 

1.25 

-0.8173 

-0.7891 

-0.7697 

-0.7622 

-0.7702 

1.50 

-0.5967 

-0.5761 

-0.5667 

-0.5642 

-0.5691 

1.75 

-0.4401 

-0.4321 

-0.4327 

-0.4347 

-0.4374 

2.00 

-0.3125 

-0.3098 

-0.3147 

-0.3162 

-0.3144 

Table  6.  Results  for  Example  4.4. 


T ime 

-8 

-16 

-32 

-48 

w 

0 

''o 

''0 

X* 

0.00 

0.0000 

0.0000 

0.0000 

0.0000 

0 . 0000 

0.25 

-0.1062 

-0.1684 

-0.2056 

-0.2188 

-0.2506 

0.50 

0 .1351 

0 .1308 

0.1245 

0.1209 

0.1086 

0.75 

0 .3456 

0 .4290 

0.4917 

0.5167 

0.5831 

1.00 

0 .4120 

0 .4976 

0.5550 

0.5767 

0.6406 

1.25 

0 .3805 

0 .4130 

0.4232 

0.4244 

0.4286 

1.50 

0 .3269 

0 .3161 

0.2966 

0.2869 

0.2586 

1.75 

0 .2988 

0 .2803 

0.2644 

0.2578 

0.2421 

2.00 

0 .3106 

0.3078 

0.3115 

0.3144 

0.3313 

2.1765 

2.3105 

2.4012 

2.4371 
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All  computations  for  the  numerical  results  presented  in 
this  paper  were  carried  out  on  the  IBM  360/67  at  Brown  University. 
We  are  grateful  to  Mr.  Douglas  Reber  for  his  assistance  in  making 
these  computations.  Indeed  Mr.  Reber  developed  with  care  and 
patience  the  software  packages  that  were  used  for  the  examples 
discussed  above  as  well  as  for  some  preliminary  numerical  studies 
6.n  our  joint  efforts  with  F.  Kappel)  of  other  examples  involving 
use  of  several  spline  type  approximations  in  an  abstract  frame- 
work similar  to  that  discussed  in  sections  2 and  3 above  and  Ref.  4. 
We  give  now  a brief  description  of  the  computational  procedures 
followed  in  producing  the  results  detailed  above. 

We  employed  an  iterative  optimization  scheme  that  combined 

gradient  (G)  and  conjugate-gradient  (CG)  steps  in  computing  the 
— N — N 

extremals  (u  ,w  ) for  the  approximating  ODE  problems.  For  the 
results  reported  above,  the  first,  fourth,  seventh,  etc. 
iterative  steps  were  G steps  while  the  remaining  steps  were  CG 
steps  formulated  (a  continuous  problem  version)  according  to  the 


scheme  given  in  Chapter  X of  Ref.  12.  Integration  of  all  ODE's  within 
these  steps  was  performed  by  a standard  modification  (Gill's)  of  a 
fourth-order  Runge-Kutta  method. 

Iteration  was  continued  until  a convergence  criteria  (for 
changes  in  J ) was  satisfied.  In  the  examples  above  this  resulted 
In  at  most  10  iterative  steps  in  each  computation  of  a (u  ,w  ) 
for  Example  4.2,  at  most  5 steps  in  those  of  Example  4.3  and  at 
most  13  steps  In  Example  4.4.  CPU  time  required  for  these 
computations  in  the  optimization  package  ran  from  82  seconds  for 
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in  Example  4.4.  As  one  might  expect,  higher  dimensional 
examples  require  increasingly  higher  amounts  of  CPU  time. 

Integration  of  the  X and  x equations  in  the  TPBVP's  for 
the  original  problems  (e.g.,  see  (14))  was  carried  out  via 
the  same  block  method  used  in  integration  of  the  delay  system 
in  Example  4.1. 

Our  experience  with  examples  suggests  that  when  employed 
with  a reasonably  efficient  optimization  scheme,  the  averaging 
approximations  for  nonlinear  control  problems  enjoy  about  the 
same  computational  behavior  as  that  already  demonstrated  by 
extensive  studies  on  use  of  this  method  for  linear  control 
problems  (see  Refs.  3 and  14). 


i 'V 
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5.  Concluding  Remarks 

The  efforts  involved  in  implementation  and  the  convergence 
results  obtained  in  practice  with  a number  of  examples  lead  us 
to  believe  that  the  approximation  ideas  of  sections  2 and  3 are  indeed 
reasonable  possibilities  when  one  is  optimizing  delay  system 
problems.  The  framework  developed  here  is  an  effective  means  of 
reducing  the  difficulties  associated  with  certain  nonlinear  FDE 
control  problems  to  those  for  nonlinear  ODE  control  problems. 

While  there  are  still  a number  of  difficulties  associated  with 
these  latter  problems,  a great  deal  more  in  the  way  of  specific 
techniques  is  available  in  the  literature  for  these  problems 
than  in  the  case  of  FDE  problems. 

The  assumptions  underlying  development  of  the  ideas  in  section  2 
are  not  so  restrictive  as  to  preclude  treatment  of  nonlinear 
systems  of  a type  arising  frequently  in  applications.  Assumption 
(H2)  is  a local  Lipschitz  condition  in  (x(t),x^)  in  the  Z-norm 
while  (H4)  is  a growth  condition  not  unlike  those  often  imposed 
in  ODE  theory  when  one  wishes  to  guarantee  existence  of  solutions 
on  any  fixed  finite  interval.  Among  nonlinearities  that  fall 
within  the  restrictions  of  (H1)-(H4)  are  those  (see  Ref.  2)  arising 
in  enzyme  kinetics,  certain  nonlinear  protein 

synthesis  models,  particle  accelerator  models  and 

bilinear  control  systems.  Also  included  are  nonlinearities 
involving  terms  of  the  form 

0 

Viyi(t)  j y2(t+0)Y(0)d9 

f(t,y(t),y^)  - 

K + I y2(t+0)Y(0)d0 
-r 


which  have  been  employed  in  vector  system  models  (y=(yj^»y2)) 
for  low  growth  chemostats  (Ref.  15) . We  are  in  fact  currently  using 
some  of  the  approximation  ideas  from  this  paper  to  study  the 
qualitative  properties  of  such  chemostat  models. 


38 


P 

References 

1.  BANKS,  H.T.,  Approximation  Methods  for  Optimal  Control  Problems 
with  Delay-Differential  Systems,  Seminaires  IRIA:  Analyse  et 
Controle  de  Systemes,  pp.  5-19,  1976. 

2.  BANKS,  H.T.,  Delay  Systems  in  Biological  Models;  Approximation 
Techniques, 

Nonlinear  Systems  and  Applications, 

Edited  by  V.  Lakshnikantham,  Academic  Press, 
New  York,  New  York,  pp.  21-38,  1977. 

3.  BANKS,  H.T.,and  BURNS,  J. A.,  Hereditary  Control  Problems; 

^ Numerical  Methods  Based  on  Averaging  Approximations,  SIAM  Journal 

on  Control  and  Optimization  (to  appear) . 

4.  BANKS,  H.T. , and  BURNS,  J.A. , An  Abstract  Framework  for 
Approximate  Solutions  to  Optimal  Control  Problems  Governed  by 
Hereditary  Systems,  International  Conference  on  Differentia) 
Equations,  Edited  by  H.A.  Antosiewicz,  Academic  Press,  New  York, 

New  York,  pp.  10-25,  1975. 

5.  KAPPEL,  F. , and  SCHAPPACHER,  W. , Autonomous  Nonlinear  Functional 

f;  Differential  Equations  and  Averaging  Approximations  (to  appear) . 

6.  REDDIEN,  G.W. , and  WEBB,  G.F. , Numerical  Approximation  of  Nonlinear 

j Functional  Differential  Equations  with  1^  Initial  Functions 

I (to  appear) . 

I 7.  REPIN,  M. , On  the  Approximate  Replacement  of  Systems  with  Lag  by 

‘ j Ordinary  Differential  Equations,  Journal  of  Applied  Mathematics 


39 


8.  SASAl , H.,  and  ISHIGASKI,  H. , Convergence  of  Approximate 
Solutions  to  Functional  Differential  Systems  by  Projection 
Methods  (to  appear) . 

9.  SASAI , H. , Approximation  of  Optimal  Control  Problems  Governed 
by  Nonlinear  Evolution  Equations  (to  appear). 

10.  CRYER,  C.W. , Numerical  Methods  for  Functional  Differential 
Equations , Delay  and  Functional  Differential  Equations  and 
Their  Applications,  Edited  by  K.  Schmitt,  Academic  Press, 

New  York,  pp.  17-101,  1972. 

11.  LUENBERGER,  D.G.,  Optimization  by  Vector  Space  Methods, 

John  Wiley  and  Sons,  New  York,  New  York,  1969. 

12.  BANKS,  H.T.  and  PALATT,  P.J.,  Mathematical  Modeling  in  the 
Biological  Sciences,  Brown  University,  LCDS  Lecture  Notes  75-1, 
1975. 

13.  NEUSTADT,  L.W. , Optimization;  A Theory  of  Necessary  Conditions, 
Princeton  University  Press,  Princeton,  New  Jersey,  1976. 

14.  BANKS,  H.T. , BURNS,  J.A. , CLIFF,  E.M. , and  THRIFT,  P.R. , 
Numerical  Solutions  of  Hereditary  Control  Problems  Via  an 
Approximation  Technique,  Brown  University,  LCDS  Technical 
Report  no.  75-6,  1975. 

15.  CAPERON,  J. , Time  Lag  in  Population  Growth  Response  of 
Isochrysis  Galbana  to  a Variable  Nitrate  Environment , Ecology , 


Vol.  50,  pp.  188-192,  1969 


